Read quality-based trimming of the distal ends of public fungal DNA sequences is nowhere near satisfactory

نویسندگان

  • R. Henrik Nilsson
  • Marisol Sánchez-García
  • Martin Ryberg
  • Kessy Abarenkov
  • Christian Wurzbacher
  • Erik Kristiansson
چکیده

DNA sequences are increasingly used for taxonomic and functional assessment of environmental communities. In mycology, the nuclear ribosomal internal transcribed spacer (ITS) region is the most commonly chosen marker for such pursuits. Molecular identification is associated with many challenges, one of which is low read quality of the reference sequences used for inference of taxonomic and functional properties of the newly sequenced community (or single taxon). This study investigates whether public fungal ITS sequences are subjected to sufficient trimming in their distal (5’ and 3’) ends prior to deposition in the public repositories. We examined 86 species (and 10,584 sequences) across the fungal tree of life, and we found that on average 13.1% of the sequences were poorly trimmed in one or both of their 5’ and 3’ ends. Deposition of poorly trimmed entries was found to continue through 2016. Poorly trimmed reference sequences add noise and mask biological signal in sequence similarity searches and phylogenetic analyses, and we provide a set of recommendations on how to manage the sequence trimming problem.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

AlienTrimmer removes adapter oligonucleotides with high sensitivity in short-insert paired-end reads. Commentary on Turner (2014) Assessment of insert sizes and adapter content in FASTQ data from NexteraXT libraries

In a recent work, Turner (2014) compared the performances of two bioinformatics programs, cutadapt (Marcel, 2011) and AlienTrimmer (Criscuolo and Brisse, 2013), to trim off exogenous oligonucleotides from short-insert paired-end reads. Turner (2014) suggested that AlienTrimmer performed with very low sensitivity. Here we show that this reported lack of performance was due to inappropriate use o...

متن کامل

Atropos: specific, sensitive, and speedy trimming of sequencing reads

A key step in the transformation of raw sequencing reads into biological insights is the trimming of adapter sequences and low-quality bases. Read trimming has been shown to increase the quality and reliability while decreasing the computational requirements of downstream analyses. Many read trimming software tools are available; however, no tool simultaneously provides the accuracy, computatio...

متن کامل

روش‌های تشخیصی بیماری‌های قارچی: از دوره کلاسیک تا عصر مولکولی

Human fungal diseases are largely a 20th and 21st century’s phenomenon. Due to use of corticosteroids and antibacterial drug, medical developmenta are associated with increased risk for number of fungal disease. These nosocomial developments in invasive mycosis were paralleled over the last two decades by the human immunodeficiency virus (HIV) pandemic, which has resulted in an even larger numb...

متن کامل

LUCY2: an interactive DNA sequence quality trimming and vector removal tool

UNLABELLED Lucy2 is a raw DNA sequence trimming and visualization tool based on the popular command-line Lucy1. Users can change parameters, trim multiple sequences and visualize the results within an integrated, easy-to-use graphical user interface. Lucy2 is designed specifically for non-programmers to use, and is currently available on Windows, Linux and MacOS X. Source code is also available...

متن کامل

The Comparison of different Procedures for DNA extraction from paraffin-embedded Tissues: A commercial kit and a traditional method based on heating

Abstract Background and objectives: Paraffin-embedded tissues and clinical samples are a valuable resource for molecular genetic studies, but the extraction of high-quality genomic DNA from this tissues is still a problematic issue. In the Present study, the efficiency of two DNA extraction protocols, a commercial kit and a traditional method based on heating and K Proteinase was compared. Mate...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017